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Abstract 

A common framework for quantum mechanics, thermodynamics and information theory is 
presented. It is accomplished by reinterpreting the mathematical formalism of Everett's many- 
worlds theory of quantum mechanics and augmenting it to include preparation according to 
a given ensemble. The notion of directed entanglement is introduced through which both 
classical and quantum communication over quantum channels are viewed as entanglement 
transfer. This point is illustrated by proving the Holevo bound and quantum data processing 
inequality relying exclusively on the properties of directed entanglement. Within the model, 
quantum thermodynamic entropy is also related to directed entanglement, and a simple proof 
of the second law of thermodynamics is given. 

We present a novel framework for quantum mechanics which we argue to be more conducive 
to the development of quantum information theory. Our motivation is both practical and one of 
principies. On the practical side, we find the Standard Copenhagen framework to be unsatisfac- 
tory in various ways. For example, there are many different reasons for a quantum system Q to 
be described by some mixed density operator pç. It could be prepared from some ensemble of 
pure states, it could be a result of an unobserved measurement performed on a pure state, it could 
be entangled with some other physical system, or it could be some combination of these three. A 
related issue is that classical and quantum communication over quantum channels appear to be 
rather independent problems pí. Another dichotomy is that two protagonists may share informa- 
tion about a physical event through communication rather than through common observation, yet 
these are treated very differently. 

Regarding principies, we would likc to view the laws of physics in information theoretical terms. 
This attitude may be traced to Wheeler's motto "It from Bit" and Wigner's observation that 
physics merely describes correlations between events H . The two are unified by the statement that 
the totality of conceptual experience can be described in terms of correlated random variables] this 
will allow us to make contact with Shannon's information theory in which random variables are 
the carriers of information. For instance, two protagonists sharing the same physical world is no 
more than classical correlations between the states of their knowledge regarding that world. Simi- 
larly, the observation of definite physical laws is no more than classical correlations between states 
of knowledge regarding two consecutive acts of measurement, or preparation and measurement, 
depending on the experiment. A ball kicked by Alice seen as obeying Newton's deterministic laws 
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of motion is merely a statement about the correlation between her knowledge of its initial vclocity 
(preparation) and that of its position when it hits the ground (measurement). 

The model we present here curtails the deficiències of the Copenhagen interpretation men- 
tioncd above, while embodying this information-theoretical principle. Perhaps surprisingly, this 
can be accomplished by reinterpreting the mathematical formalism of a radically different theory of 
quantum mechanics, Everett's Many-Worlds interpretation |J, and augmenting it to include prepa- 
ration, an indispensable ingredient for describing communication [Q. In our model the universe 
U is divided into subsystems which can either belong to the set of "physical" entities V or to the 
set of instances of knowledge JC. The set JC partly corresponds to what Everett called "observers", 
by which he essentially meant another physical system, such as a quantum computer or the brain. 
Assuming that observers are entirely "physical" leads to the unsolved basis problem ||. Here we 
do not imbue JC with a "physical" interpretation; it merely keeps track of the conceptual experience 
of the observer/preparer in relation to what has been observed/prepared. We pause to justify the 
unconventional step of including conceptual experience in a physical model. Everett points out in 
that physical theories have always consisted of two parts, the "formal" mathematical part and 
the " interpretive" part: a set of rules for connecting the mathematical formalism to our conceptual 
experience. He warns that the unrigorous nature of the second part often leads to a successful 
model being mistakenly identified with "reality." It is thus the conceptual experience that is the 
subject of physics, and including it in the model itself can only help avoid the above pitfall. 

The set JC is divided into subsets associated with particular protagonists, such as JCaiícc and 
K-Bob- At any given time the universe is described by a pure state \^)u- The statc of some 
subsystem A is described by the density matrix obtained from \^)u by tracing out the rest of the 
universe U/A: 

PA = tr u/A \*){*\ u . (1) 

The Hilbert space of U is constantly being augmented by new instances of knowledge, initially in 
some fixed pure state (although they immediately get entangled with already existing memebers 
of U; as pure states they serve no function). Otherwise, 1^)^ can only evolve from one moment to 
the next according to some unitary operator U 



|*> M — ► üï*>u, (2) 

possibly entangling the different subsystems. These are the building blocks of the theory. Now we 
illustrate how measurement, preparation and communication are described in terms of it. 

(i) Measurement. The measurement process à la Everett is described as follows. Bob wishes 
to perform an elementary measurement on some m-dimensional physical system L in the orthonor- 
mal basis Denote by the reference system R that subsystem of the universe U which is 

entangled with L, so that RL is in a pure state 

77 1 

\^)RL = J2 X ^)R\j)L- 

The unobserved measurement consists of the measurement apparatus M, initially in some pure 
state, becoming entangled with RL via some unitary operator acting on LM only. The state of 
RLM becomes 

m 

\^)blm = Y^^Mí)h\3)l\3)m, 

3=1 

where {|j)m} is an orthonormal basis for M. The observed measurement consists of the production 
of an m-dimensional system B € JCboò m some pure state, followed by a unitary transformation 
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acting on MB only. This results in 

m 

\^)RLMB = ^\j\<Pj)B.\j)L\j)M\j)B, 
j=l 

where \j)b form an orthonormal basis for thc Hilbert space Tíb of B. These should be thought 
of as shorthand notation for the mutually exclusive states of Bob's knowledge with respect to the 
obscrvation of M, = |observe M in the state \j)m)b- The density matrix ps of B, viewed in 
the \j) b basis, has diagonal elements |Aj| 2 . We define the associated random variable B as 

Pr(B=j) = (j\p B \j) B = \X j \ 2 . 

The Shannon entropy of B is defined as i/(B) = — J2j Pffò = j) l°g Pr(B = j). Thc crucial point 
is the following. We started off by modelling the mutually exclusive states of Bob's knowledge by an 
orthonormal basis for B. If ps were diagonal in this basis, it would have the natural interpretation 
of B being in the state \j) B with probability Pr(B = j), and thus isomorphic to the random 
variable B. However, in general off-diagonal elements do exist; the theory then postulates that 
Bob is blind to this fact, since it is not in accord with his classical probabilistic vision, referred to 
henceforth as "diagonal vision." This is, of course, just a manner of speech; all that is being said 
is that only the diagonal clements have an intepretation. 

One might think that if Bob's expcrience is not based in "rcality" there should be a discrepancy 
between his experience and that of othcrs. This is not the case. If his friend Charlie takes a look at 
the readout of M, a new system C will be produced in such a way that thc joint system RLMBC 
is now in the state 

\^)rlmbc = r\Í) l\í) m\í) b\í)c , 

J=l 

with the \j)c defined analogously to \j) B - To compare Bob's and Charlie's experiences we restrict 
attention to the diagonal clements of the joint density matrix p B c associated with the joint random 
variable BC. It can be easily verified that B and C are perfectly correlated, namely Pr(B = C) = 
1. In information theoretical terms, we have 

J(B;C) = if(B) = if(C), 

where /(B; C) = H(B) + H(C) — íf (BC) is the mutual information between B and C. The same 
happens when Charlie does not observe the readout of M, but instead measures L in the same 
basis w ith his own apparatus N. Bob can also perform a generalized measurement on some 

physical system Q. This is done by entangling Q with L via some unitary operator acting on QL, 
and subsequently performing an elementary measurement on L, as described above. Note that 
the only mathematical difference between our treatment of measurement and Everett's lies in our 
focus on density matrices in JC as the source of random variables. However, preparation, our next 
tòpic, is altogether new in this context, 
(ii) Preparation. 

Alice wishes to prepare some Q £ V according to an ensemble of pure states {(pi, |^í)q) : i = 
1, . . . n}, Y^i=i Pi = 1> by which we mean that the state \4>%)q is prepared with probability pt. This 
can be accomplished by 

1) mcasuring Q and bringing it into some fixed state |0)q, thereby disentangling it from its 
original reference system. 

2) performing conditional unitary operations according to the ensemble {(pi, Ui) : i = 1, . . . n}, 
such that \iI>í)q = Uí\0)q. 

We omit the description of process 1); suffice it to say that the end result is the system AiQ 
being in the state 

l*>A 1 g = |0)A 1 |0) O) 



3 



where A\ € ICaiícc and |0) J 4 1 represents Alice knowing that Q is in the state |0)q. Now the outcome 
of process 2) is the creation of A2 £ IC Alice so that the state of A\QA^ is 

n 

\^)a iQ a 2 = \Q)ai VpÏ\Í>í)q \í)a* , (3) 
ï=i 

where the basis state \i) a 2 stands for Alice knowing that she has performed the unitary operation 
Ui. Finally A € )C Alice is produced giving rise to 

n 

\^)a iQ a 2 a = |0)^ VpÏ\^í)q\í)a 2 \í)a, (4) 
í=i 

where the basis state \i) a represents Alice knowing that Q is in the state \ÍH)q after the application 
of the conditional unitary transformation. The transition from (|J) to (||) resembles a primitive 
quantum computation 

Upon preparation, the system Q is in the mixed state pç — X)íLií , í|V'í)(V , í|q· The density 
matrix pa of A, viewed in the \Ï)a basis, has diagonal elements pi. Just as with Bob, these 
diagonal elements represent the probabilitics of Alice experiencing the corresponding basis states 
and we associated them with the random variable A. 

(iii) Communication. Communication from Alice to Bob takes place by Alice encoding a 
message by preparing a physical system Q and Bob subsequently performing a generalized mea- 
surement on it. Thus the procedures of (i) and (ii) are combined, with the composite system 
QA2A now playing the role of the reference system R. Consequently A and B become entangled 
via A2QLM. The diagonal elements of their joint density matrix pab are now associated with the 
joint random variable AB. The mutual information between what Alice prepared and what Bob 
received is simply /(A; B). Thus it can be read off very simply from the joint density matrix of 
Alice and Bob. 



Directed entanglement and quantum channels. The theory hitherto presented suggests 
that all communication can be viewed as entanglement transfer. Initially the sender A is entangled 
with A2Q only. Gradually the entanglement is passed on through L, M and finally to B, the 
receiver. Intuition suggests that the final entanglement between A and B cannot exceed the initial 
entanglement between A and Q. In addition, one would expect Alice's and Bob's diagonal vision to 
further reduce their " experienced" entanglement. We now make these ideas concrete by introducing 
the notion of directed entanglement. 

Directed entanglement E(X — » Y) from the system X to the system Y is defined as 

E(X -»Y)= S(Y) - S(XY), (5) 

where S(Y) is short for the von Neumann entropy S(py) of the density matrix py, S(py) — 
—tr{py log py)- S(XY) is defined analogously. The connnection between E and entanglement was 
observed by Schumacher and Nielsen [[7] when it first appeared in the guise of coherent information 
I c . E(X — > Y) may also be viewed as the negative of the conditional entropy S(XJY) — S(XY) — 
S(Y), a quantity investigated in some detail, albeit without physical context, in M. 
We list some useful properties of E{X — > Y): 

(a) -S(X) < E(X -^Y)< S(X) 

(b) E(X -> Y) < E(X -> YZ) 

(c) E(XY -> Z) = E(X -> Z) + E(Y -> XZ) 

(d) E(X YZ) > E(X -> Y) + E(X -> Z) 
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(c) E(XY -> ZW) > E(X Z) + E(Y -> W) 

(f) £(X -> Y) > E(Z -> y c ) > E(X C -> y c ) 

(g) — » y) is invariant under local unitary transformations f/x ® C/y 

(h) < £(Jf -> Y~ c ) < 

We omit the proofs, many of which can be found in H. In (b) equality holds when Z is 
disentangled from XY, and we shall refer to this fact as property (b'). Similarly, in (e) equality 
holds when XZ is disentangled from YW, and we label this by (e'). In (f) and (h), X c refers to 
the classicized system X, stripped of its off-diagonal elements in some preferred basis. Note that 
S(X C ) = -ff(X). We now use these properties to prové the two main theorems of classical and 
quantum information processing: the Holevo bound and the quantum data processing inequality. 

Consider sending classical information over some noisy channel £ : pç — > S(pq). This differs 
from (iii) in that the system Q gets entangled with some unobserved environmcnt E (via some 
unitary transformation Uqe) between the preparation and measurement phases. The total system 
involved in the process is thus AA^QELMB. Just after the interaction with E the system AQ is 
described by the density matrix 

n 

PAQ = ^V^ïK>(*L£(IV'i)<V'í|g), 
i=l 

and hence 

E(A^Q)= X -H(A), 
where the Holevo quantity x is given by 



X 



= S(S(p Q )) -^PiS^dViX^lg)). 



Denoting by primes quantities calculated after the interaction with LMB, we have the following 
string of equalities and incqualitics 

E(A ->Q) = E(A -► QLMB) = E'(A -> QLMB) 
> E'(A -> B) > E'{A C -> B c ) = H(B) - íf(AB). 

The first four relations are due to (V), (g), (b), and (f) respectively. This gives rise to the Holevo 
bound 

J(A;B)< X . (6) 

Equality is asymptotically achieved by block coding in the limit of large blocklength || . 

One can also send quantum information over a noisy channel. Consider the physical system Q 
being sent through two noisy channcls £\ and £2 consecutively. Thus Q, initially entangled with the 
reference system R only, gets entangled first with E\ and then with E2 via some U ae x and Uae 2 7 
respectively. We denote by primes quantities calculated after the interaction with environment E\ 
and by double primes those calculated after the interaction with E2. Then we have, by (b), 

E"(R -> QE l E 2 ) > E"{R -» QE 2 ) > E"{R -> Q). 

Noting E"{R QE X E 2 ) = E(R QE X E 2 ) = E(R -> Q) and E"{R -> QE 2 ) = E'(R Q£; 2 ) = 
i?'(iï — * Q), both consequences of (g) and (b'), we get the quantum data processing inequality 

S(p Q ) > Ic{ P Q,£l) > I c (pq,£ 2 o£i), (7) 
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where I c is the coherent information |7|. Equality is again achieved asymptotically []9|. 

We have seen that the key quantum noisy channel relations for sending classical (JsJ) and quantum 
(0) information both follow from the properties of directed entanglement. [] We expect this to 
open avenues, e.g., for adapting the quantum channel coding techniques developed by Lloyd || 
to produce classical codes saturating the Holevo bound, just like Schumacher compression may be 
applied to classically prepared information. 



Quantum thermodynamics. With the developments of the previous sections it becomes clear 
that the thermodynamic entropy can only be defined relative to elements of IC. We define the 
thermodynamic entropy St(Q\B) of a system Q relative to B £ ICboò as 

S T {Q\B) = S(Q\B C ) = -E(Q -> B c ), (8) 

i.e. it is the negative of the directed entanglement from Q to the classicized system B. In terms of 
Standard quantum mechanics, it corresponds to the average von Neumann entropy of the system 
Q "as viewed by Bob" upon measuring or preparing it. By property (h), it lies between and 
S(Q), the upper bound attained when B is disentangled from Q. 

Of course, our definitions of the thermodynamic entropy must be justified by the zeroth and 
second laws of thermodynamics Jï(| . For proving the zeroth law it sufhces that the sum of ther- 
modynamic entropies of two initially unentangled systems cannot decrease when they interact 
(see |Ï0) for a full treatment). This, in turn, follows from properties (b) and (c), which give 
St(Qi\B) + St{Q"2,\B) > St(QiQ2\B), and noting that equality holds when Qi and Qi are unen- 
tangled. 

The second law is somewhat more subtle, and holds only for macroscopic systems. The Hilbert 
space of our macroscopic system Q may be divided into a tensor product Hq = Hq> ®Hq< , where 
Q > and Q < represent the macroscopic and microscopic degrees of freedom, respectively. The second 
law is a consequence of the lack of knowledge of the microstructure of the physical system, and 
the macrostructure being converted into microstructure as the system evolves Accordingly, 
it is the coarse-grained entropy, S^(Q\B) = St(Q > \B), i.e. the thermodynamic entropy of Q > , 
that increases. To prové this we introduce a dummy system B £ K,Bob which keeps track of 
Bob's (nonexistent) knowledge of Q > , and hence remains disentangled from QB at all times. 
Initially B is maximally entangled with Q > , as a consequence of preparation or measurement, 
and Q < has a uniform density matrix and is disentangled from BQ > . By property (e') we have 
E(Q>Q< -> B C B) = E(Q> ->■ B c ) + E(Q< -> B). Since Q is closed, it evolves via some unitary 
operator U Q > Q <, entangling Q> and Q<, so that E'(Q > Q < -> B C B) = E(Q>Q< -> B C B) by 
(g). Property (e) implies E'{Q > Q < -> B C B) >^E'(Q> -> B c ) + E'(Q< -> B), and hence (in 
obvious notation) AE(Q > — * B c ) + AE(Q < — » B) < 0. Since Q < initially had a uniform density 
matrix, it follows that AE(Q< -> B) > 0. Therefore AE(Q> -> B c ) < 0, i.e. S^(Q\B) has 
increased. There is, however, no a priori reason why S^(Q\B) should continue to increase since 
the density matrix of Q < is not uniform any more. At this point we invoke the idea, central to 
Wilson's renormalization group, that the many lengthscales of Q are locally coupled [Q. Hence 
we may further decompose the Hilbert space of Q K into spaces of increasing size, corresponding to 
decreasing lengthscales, Hq — 'Hq ®'Hq 1 ®· • ; in such a way that the time evolution entangles only 
neighboring spaces, causing a cascade effect. In the light of this principle, the above calculation 
is now modified by replacing Q < with Qq. Aftcr Q > and Qo interact, Qo will interact with the 
completely uniform (and larger in size) Qi, which will thoroughly randomize it. Thus, when Q > 
and Qo interact again, Q < starts off with a uniform density matrix so that S^(Q\B) continues to 
increase. The cascade continues, the vast microscopic degrees of freedom constantly radomizing 

1 It should be stressed that the proofs given are mathematically akin to already existing ones jïj , only reexpressed 
in the common language of entanglement transfer. 
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the more macroscopic ones, until thermodynamic cquilibrium is reached. Interestingly, the same 
mcthod works if we consider the second law to be a consequence of interactions with an unobserved 
environment, rather than coarse graining |Ï3|] . The microscopic degrees of freedom are, in essence, 
an unobserved environment. 

The model presented provides a simplified and unified view of physics and communication, with 
directed entanglement (|^) as the key binding quantity. As advertised, correlated random variables 
come about rather naturally, and drawbacks of the Copenhagen interpretation mentioned in the 
introduction are taken care of. Mixed density operators are always a consequence of entanglement, 
and both classical and quantum communication over quantum channels are seen as the transfer 
of entanglement. Correlated random variables describing the conceptual experience of various 
protagonists arise from the entanglement between different instances of knowledge, interpreted 
through diagonal vision. These instances of knowledge are always entangled via some physical 
system. In (i) Bob and Charlie experienced the same measurement result due to being entangled 
via Q. Alternatively, Charlie could have learned the measurement outcome by Bob communicating 
it to him, in which case they would get entangled indirectly via some physical system used to encode 
the message. Finally, quantum thermodynamic entropy is defined as the lack of entanglement 
between instances of knowledge and physical systems. In addition we have presented a simple and 
intuitive proof of the second law of thermodynamics. Thus the model promises to become a very 
practical framework for understanding and developing quantum information theory and quantum 
thermodynamics. On a philosophical lcvel it makes Bohr's insight that the wavefunction merely 
represents knowledge much more precise by relating it to the true carriers of Shannon information 
- random variables. 
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